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Purpose: Recent developments in genomics have led to expanded 
carrier screening panels capable of assessing hundreds of causal 
mutations for genetic disease. This new technology enables simulta- 
neous measurement of carrier frequencies for many diseases. As the 
resultant rank-ordering of carrier frequencies impacts the design and 
prioritization of screening programs, the accuracy of this ranking is 
a public health concern. 

Methods: A total of 23,453 individuals from many obstetric, genet- 
ics, and infertility clinics were referred for routine recessive disease 
carrier screening. Multiplex carrier screening was performed and 
results were aggregated for this study. 

Results: Twenty-four percent of individuals were identified as 
carriers for at least one of 108 disorders, and 5.2% were carriers for 



multiple disorders. We report tabulations of carrier frequency by self- 
identified ethnicity and disease. 

Conclusion: To our knowledge, this study of a large, ethnically 
diverse clinical sample provides the most accurate measurements to 
date of carrier frequencies for hundreds of recessive alleles. The study 
also yields information on the clinical considerations associated with 
routine use of expanded panels and provides support for a pan-ethnic 
screening paradigm that minimizes the use of "racial" categories by 
the physician, as recommended by recent guidelines. 
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INTRODUCTION 

Carrier screening is the practice of diagnosing individuals and 
couples at risk of conceiving children affected by recessive dis- 
eases. This practice is widespread and growing in importance, 
as carrier couples identified before conception may decide to 
pursue a variety of preventive measures, including in vitro fer- 
tilization with preimplantation genetic diagnosis, use of donor 
gametes, adoption, or early prenatal diagnostic testing. 

In 2001 cystic fibrosis (CF) became the first disease recom- 
mended for routine carrier screening in the United States when 
the American Congress of Obstetrics and Gynecology (ACOG) 
and the American College of Medical Genetics (ACMG) jointly 
recommended that CF screening be offered to all individuals of 
Caucasian and Ashkenazi Jewish ethnicity and that it be "made 
available" to individuals of all other ethnic groups. 1 In 2005, 
in recognition of increasing population ethnic admixture, the 
ACOG Committee on Genetics stated it was reasonable to offer 
CF screening pan-ethnically. 2 

Routine carrier screening for spinal muscular atrophy 
(SMA) has become more common since 2008 when ACMG 



recommended that screening be offered pan-ethnically. 3 
However, SMA screening is still not performed as routinely as 
CF testing, 4,5 in part because of a conflicting ACOG statement 
that SMA should not be offered to the general population. 6 

In the absence of family history, carrier screening for diseases 
beyond CF and SMA is typically offered based on stated ethnic- 
ity. For example, hemoglobinopathy carrier screening is offered 
to individuals of African, Southeast Asian, or Mediterranean 
ancestry, 7 whereas a panel of 4-16 diseases is routinely offered 
to individuals of Ashkenazi Jewish ancestry. 810 

However, the advent of modern genomics has changed the 
health economic calculus around genetic screening, with costs- 
per-megabase for DNA sequencing falling at a rate faster than 
Moore's Law would indicate. 11 Consequently, the amortized cost 
of carrier screening across the population for hundreds of genetic 
diseases is now far less than the cost of treating an affected 
child. 121314 Genomics has also called into question the reifkation 
of "racial" categories in clinical guidelines, as high-throughput 
assays now allow a physician to prescribe the same test for every 
patient, regardless of ethnicity, with little or no increase in cost. 
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Here we present clinical data from a large-scale deployment of 
expanded carrier screening across dozens of practices. In addi- 
tion to providing perhaps the best estimates to date of a wide 
range of Mendelian allele frequencies, the deployment provides 
a concrete example of how genomics may reduce the role of 
"race" in biomedicine. It also identifies a number of severe, fre- 
quent, and easily detected disease alleles that have been omitted 
to date from population screening programs but that are easily 
included in expanded carrier testing panels. 

MATERIALS AND METHODS 

Multiplex platform 

The screening platform (Universal Genetic Test, Counsyl, South 
San Francisco, CA) uses high-throughput genotyping to identify 
disease- causing variants and corresponding wild-type alleles. In 
total, 417 disease-causing mutations associated with 108 recessive 
diseases were assayed and interpreted via fluorescent sequences. 
We consider a subset (listed in Supplementary Table SI online) 
of the diseases identifiable via the platform. The analytical per- 
formance has been reported previously and compares favorably 
with traditional single-gene testing methods. 12 Our reported dis- 
ease carrier frequencies are lower bounds, as there are variants 
for each disease that the assay platform does not test. 

Study population 

This study sample included 23,453 individuals. Referral sources 
included family practitioners, geneticists, genetic counselors, 
obstetricians, perinatologists, and reproductive endocrinologists. 
Documented informed consent, including consent to research, was 
required of all patients and is on record at our facility Institutional 
review board exemption is applicable due to de-identification of 
the data presented (45 CFR part 46.101(b)(4)). Genetic counsel- 
ing was made available at no cost to all tested individuals. 

Funding was performed as fee-for-service, typically reim- 
bursed through third-party payers. 

Disease panels 

Diseases identifiable via the test platform range in severity from 
mild to incompatible with life. Nearly every disease can be con- 
sidered severe (associated with infant or child mortality), signifi- 
cant (associated with progressive disease and reduced life span), 
or requiring significant intervention or treatment. The remain- 
der is composed of milder conditions characterized by high 
allele frequencies and reduced penetrances, e.g., factor V Leiden 
thrombophilia (OMIM 188055), glucose-6-phosphate dehydro- 
genase deficiency (OMIM 305900), HF£-associated hereditary 
hemochromatosis (OMIM 235200), prothombin thrombophilia 
(OMIM 176930), pseudocholinesterase deficiency (OMIM 
177400), and two common MTHFR mutations (OMIM 607093). 

As the testing platform permits customization of disease panels, 
some physicians chose to prescribe testing for a subset of the full 
417-mutation panel. In addition, ongoing improvements to the 
platform resulted in additions and refinements of various disease 
mutations. The sum effect of these changes is that the number of 
individuals tested for a given mutation varies moderately, as shown 



in the first column of Supplementary Table SI online. All OMIM 
numbers are also noted in Supplementary Table SI online. 

RESULTS 

Data were available from 23,453 screened patients. Routine 
screening for a possible carrier state was the indication in all 
individuals. Screening for other indications, including poten- 
tial gamete donor, infertility, and personal or family history, 
was also performed. Individuals with those indications were 
excluded from this study. 

Population demographics 

Self-reported ethnicity is summarized in Table 1. Caucasians 
constituted 60.6%, and 75.0% were female. Most samples came 
from within the United States; a minority came from Australia, 
Canada, England, or New Zealand. 

The median age was 33.0 years and mean age was 33.63 years. 
Most (n = 17,865) of the individuals were from 21-39 years 
of age. A small percentage was under 21 years (1.6%) or over 
45 years (2.8%). 

Disease carrier states 

Alleles associated with 96 recessive diseases (excluding mild con- 
ditions) were identified in our 23,453 patients (Supplementary 
Table SI online). Among the mild conditions, the most com- 
mon was MTHFR deficiency, with a frequency of 1 in 1.9. 
Because these conditions typically have limited reproductive 
decision-making significance, they will not be considered fur- 
ther in this analysis. All subsequent analysis considers only 
those diseases listed in Supplementary Table SI online. 

Of the total sample, 24.0% of individuals (n = 5,633) were 
heterozygous for at least one non-mild condition. A total of 
7,067 heterozygous states were identified. Carrier statistics are 
fully reported in Supplementary Table SI online. 

Seventy-eight individuals were identified as homozygotes 
or compound heterozygotes for the following conditions: a-1- 
antitrypsin deficiency (n = 38), cystic fibrosis (n = 9), GJB2- 
related DFNB1 nonsyndromic hearing loss and deafness (n = 
6), factor XI deficiency (n = 5), Gaucher disease (n = 4), famil- 
ial Mediterranean fever (n = 3), carnitine palmitoyltransferase 
II deficiency (« = 2), medium chain acyl-CoA dehydrogenase 
deficiency (« = 2), sickle cell disease (n = 2), short chain acyl- 
CoA dehydrogenase deficiency (n = 2), achromatopsia (n = 1), 
(3-thalassemia (n = 1), hexosaminidase A deficiency («= 1), famil- 
ial dysautonomia (n = 1), lipoamide dehydrogenase deficiency (n 
= 1), Niemann-Pick disease type C (n = 1), Pompe disease (n = 
1), and spinal muscular atrophy (n = 1). The published specific- 
ity of the testing method suggests that these are "true positives" 
and merit further examination of clinical correlations. 12 Review 
of clinical notes found at least two individuals with previously 
known diagnoses, for Gaucher disease and deafness. Another 
individual reported a history of sickle cell disease but did not 
specify whether this was familial or personal. All others did not 
report diagnosis information in their test requisitions and selected 
the routine carrier screening indication. For asymptomatic 
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Table 1 Carrier statistics categorized by self-reported ethnicity 



ethnicity 


Screened, n 


Ethnicity as % of 
total group (%) 


n 


Carrier frequency 
Percentage (%) 


1 in 


All 


23,453 


100.0 


5,633 


24.0 


4.2 


African or African-American 


1,193 


5.1 


188 


15.8 


6.3 


Ashkenazi Jewish 


2,410 


10.3 


1,050 


43.6 


2.3 


East Asian 


1,121 


4.8 


95 


8.5 


11.8 


Finnish 


34 


0.1 


16 


47.1 


2.1 


French Canadian or Cajun 


193 


0.8 


63 


32.6 


3.1 


Hispanic 


2,302 


9.8 


423 


18.4 


5.4 


Middle Eastern 


512 


2.2 


76 


14.8 


6.7 


Mixed or other Caucasian 


6,744 


28.8 


1,650 


24.5 


4.1 


Native American 


48 


0.2 


3 


6.3 


16.0 


Northern European 


6,171 


26.3 


1,630 


26.4 


3.8 


Pacific Islander 


77 


0.3 


8 


10.4 


9.6 


South Asian 


1,123 


4.8 


144 


12.8 


7.8 


Southeast Asian 


462 


2.0 


59 


12.8 


7.8 


Southern European 


1,063 


4.5 


228 


21.4 


4.7 



individuals with conditions such as a- 1 -antitrypsin deficiency, 
results could be used to guide appropriate surveillance. 

Carrier rate variability by ethnicity 

Frequencies of positive results varied by ethnic group (Table 1). 
On average, 24.0% of individuals were positive for at least one 
condition. When stratified by self-reported ethnicity, this fre- 
quency ranged from 43.6% of Ashkenazi Jewish individuals 
to 8.5% of East Asians. For ethnic groups like the Ashkenazi 
Jewish, this frequency is unsurprising given the availability of 
screening for many conditions found in this group. 

Multiple-disease carriers 

Some individuals were heterozygous for multiple disorders 
(Table 2), with -5.2% (n = 1,210) found to be carriers of two 
or more disorders. Most were heterozygous for only two condi- 
tions (4.3% of all screenees and 83.9% of multiple-disease car- 
riers), although a small number were carriers of three (0.7 and 
13.8%) or more than three conditions (0.1 and 2.3%). Ashkenazi 
Jewish individuals were most frequently identified as multiple 
carriers, with 13.3% of all tested Ashkenazi Jews carrying more 
than one genetic disorder. These values are unsurprising, as 
population geneticists have long known that individuals carry 
on average 4-5 recessive lethal alleles. 10,15 Put another way, the 
average number of positive results for recessive lethals in this 
panel is only (0 x 0.694 + 1 x 0.244 + 2 x 0.053 + 3 x 0.008 + 
4x 0.001) = 0.378; sequencing the entire genome of each patient 
would reveal ~ 10 times as many lethal recessives on average. 

Carrier couples 

Some pairs of patients identified themselves as couples. In this 
set, we found several "carrier couples" in which both partners 
were heterozygous carriers for the same condition. Table 3 lists 
disorders identified in carrier couples at elevated risk of passing 
the carried disorder to their children. 



Comparison to traditional screening guidelines 

Among carrier states we detected, 76.7% were for diseases 
not included in ACOG carrier screening guidelines, whereas 
69.0% were for diseases not included in ACMG guidelines. The 
ACMG's inclusion of spinal muscular atrophy accounted for 
most of the discrepancy. 

We detected 433 individuals who would not have been identi- 
fied as disease carriers in accordance with conventional ethnic- 
ity-based screening paradigms (Table 4). For example, a non- 
Jewish carrier for familial dysautonomia would not be identified 
if following guidelines that recommend screening only for those 
with Jewish ancestry — 26.3% of familial dysautonomia carriers 
in our dataset did not report Jewish ancestry. Rates of non-Jew- 
ish carriers are even higher, an average of 40.7%, for diseases 
recommended by the ACMG. Furthermore, in Table 5, we show 
the 10 diseases with the highest carrier frequency for each popu- 
lation with more than 500 tested individuals, along with ACOG/ 
ACMG screening recommendations. In general, many prevalent 
diseases in each population are not currently recommended for 
screening by either the ACOG or the ACMG. 

DISCUSSION 

Data quality and limitations 

This study represents the first large-scale analysis of patients 
undergoing carrier screening for an extensive list of recessive 
disorders in a clinical setting. As such, the data provide the first 
report of carrier frequencies for many rare disorders in mul- 
tiple ethnic groups and are an important resource for guiding 
diagnosis, treatment, and prevention of Mendelian disease. 

Because our assay was based on targeted genotyping, we 
were able to find only carriers of particular disease-causing 
mutations, not all carriers of each disease. (Patient reports 
explain that genotyping can only establish risk reduction, not 
risk elimination.) Consequently, the disease frequencies we 
report should be considered lower bounds, as there are other 
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Table 2 Carriers of multiple diseases categorized by self-reported ethnicity 

Carrier, one Carrier, two Carrier, three Carrier, four or Carrier, two or 

disease diseases diseases more diseases more diseases 





f) 


% Total 8 




% Total 3 

/o i \j La i 




% Total 3 




% Total 3 

/o i xj La i 




% Total 3 


°/n f art* i pre 

/O V_ CI 1 1 1 C 1 J 


All 


A,A2S 


18.9 


1,015 


4.3 


167 


0.7 


28 


0.1 


1,210 


5.2 


21.5 


Mixed or other 
Caucasian 


1,315 


19.5 


289 


4.3 


39 


0.6 


7 


0.1 


335 


5.0 


20.3 


Northern 
European 


1,320 


21.4 


266 


4.3 


42 


0.7 


2 


0.0 


310 


5.0 


19.0 


Ashkenazi Jewish 


729 


30.2 


244 


10.1 


61 


2.5 


16 


0.7 


321 


13.3 


30.6 


Hispanic 


367 


1 5.9 


48 


2.1 


8 


0.3 


0 


0.0 


56 


2.4 


13.2 


African or 
African-American 


144 


12.1 


41 


3.4 


2 


0.2 


1 


0.1 


44 


3.7 


23.4 


South Asian 


109 


9.7 


31 


2.8 


3 


0.3 


1 


0.1 


35 


3.1 


24.3 


East Asian 


79 


7.0 


13 


1.2 


2 


0.2 


1 


0.1 


16 


1.4 


16.8 


Southern European 


180 


16.9 


43 


4.0 


5 


0.5 


0 


0.0 


48 


4.5 


21.1 


Middle Eastern 


62 


12.1 


11 


2.1 


3 


0.6 


0 


0.0 


14 


2.7 


18.4 


Southeast Asian 


44 


9.5 


14 


3.0 


1 


0.2 


0 


0.0 


15 


3.2 


25.4 


French Canadian 
or Cajun 


54 


28.0 


8 


4.1 


1 


0.5 


0 


0.0 


9 


4.7 


14.3 


Pacific Islander 


7 


9.1 


1 


1.3 


0 


0.0 


0 


0.0 


1 


1.3 


12.5 


Native American 


2 


4.2 


1 


2.1 


0 


0.0 


0 


0.0 


1 


2.1 


33.3 


Finnish 


11 


32.4 


5 


14.7 


0 


0.0 


0 


0.0 


5 


14.7 


31.3 



3 The "% Total" refers to the percentage of individuals in that row's population carrying the specified number of diseases. For example, of all screened 
Northern Europeans, 1 ,320 or 21 .4% carried one disease, 266 or 4.3% carried two diseases, and 5.0% carried two or more diseases. Of all carriers 
who were Northern European, 19.0% carried two or more diseases. 



Table 3 Carrier couples identified 
Disease 



Number of 
couples 



a-1 -Antitrypsin deficiency 47 

Cystic fibrosis 27 

Spinal muscular atrophy 15 

Sickle cell disease 10 

Familial Mediterranean fever 5 

GJB2-related DFNB1 nonsyndrome hearing loss and 

deafness 5 

Smith-Lemli-Opitz syndrome 3 

Factor XI deficiency 3 

Familial dysautonomia 2 

(3-Thalassemia 2 

Gaucher disease 2 

Tay-Sachs disease 

Sulfate transporter-related osteochondrodysplasia 
Medium chain acyl-CoA dehydrogenase deficiency 
Short chain acyl-CoA dehydrogenase deficiency 
Carnitine palmitoyltransferase II deficiency 
Mucolipidosis IV 



Carrier couples are defined as pairs of individuals identified as partners 
by self-report, both sharing heterozygosity for at least one disease. This 
table likely represents a subset of carrier couples we have identified, 
because most were tested as individuals and we therefore did not know 
who the partner was. 



disease-causing alleles not surveyed here. Further examination 
of each individual disease on an allelic basis will help elucidate 
important data points such as overall carrier frequency, disease 
penetrance, and clinical sensitivity. 

Although our targeted genotyping data is not comprehen- 
sive, it is a significant improvement on carrier rate estimation 
based on disease incidence rates. Such estimation suffers from 
statistical challenges such as underreporting of mild pheno- 
types and embryonic/child mortality affecting the assump- 
tions of Hardy- Weinberg equilibrium. 24,29,30 In contrast, our 
data come from directly assaying carrier genotypes rather than 
relying on mapping from phenotypic incidence to genotype. 

Comparison to previously published carrier frequencies 

Table 5 shows, for each ethnic group with n > 500 in our data 
set, the 10 most commonly detected diseases, along with our esti- 
mates and literature estimates for carrier frequency. For a handful 
of disease/ethnicity pairings, we were unable to find a previously 
published estimate. For most diseases, particularly in the well- 
studied Northwestern European and Ashkenazi Jewish popula- 
tions, our estimates are similar to previously published estimates 
of carrier frequency. However, there are some notable outliers. 

We observed significantly higher carrier frequencies than 
expected for severe diseases in several populations. In particu- 
lar, we find a cystic fibrosis carrier frequency of 1 in 40 among 
south Asians (95% confidence interval 1 in 29-64), in marked 
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Table 4 Carrier statistics among ACOG- and ACMG-targeted populations 

Targeted ethnic group statistics 8 Pan-ethnic statistics b 







Carrier frequency 
in targeted population 


Carrier frequency 
in all populations 


Carriers not from targeted 
ethnic group 


Disease 


Population 


n Carriers/ 
n total 


n Carriers/ 
n tested 


n Nontargeted carriers/ 
n total carriers 


0/ 

/o 


Sickle cell disease 
(Hb S mutation) 1 


African-American 


89/1,121 


145/21,360 


56/145 


38.6 


|3-Thalassemia c 


African-American 


27/1,121 


163/21,443 


72/1 63 


AA 1 




Southern European 


1 7/999 










South Asian 


33/1,068 










Southeast Asian 


14/417 








Canavan disease cd 


AJ" 


43/2,384 


71/21,581 


ZO/ / I 




Familial dysautonomia cd 


AJ 


56/2,385 


76/21,592 


ZU//D 


ZO. J 


Tay-Sachs disease/ 

Hpy A Hpf iripnru 

1 ICA rA LI CI IUCI l*-y 


AJ 


90/2,386 


151/21,985 


61/151 


40.4 


Fanconi anemia group C 


AJ 


24/2,385 


44/21,476 


20/44 


45.5 


Niemann-Pick type A d 


AJ 


25/2,385 


33/21,476 


8/33 


24.2 


Bloom syndrome d 


AJ 


23/2,384 


35/21,466 


12/35 


34.3 


Mucolipidosis IV d 


AJ 


18/2,385 


36/21,476 


18/36 


50.0 


Gaucher disease d 


AJ 


142/2,384 


280/21,473 


138/280 


49.3 


Cystic fibrosis cd 


Caucasian 


521/14,155 










AJ 


90/2,402 










African-American 


9/1,188 




N/A — pan-ethnic screen 






Asian 


16/2,777 










Hispanic 


33/2,294 








Spinal muscular atrophy d 


Caucasian 
AJ 


285/14,005 
40/2,371 










African-American 


10/1,178 




N/A — pan-ethnic screen 






Asian 


34/2,745 










Hispanic 


28/2,274 









ACMG, American College of Medical Genetics; ACOG, American Congress of Obstetrics and Gynecology; AJ, Ashkenazi Jewish, 
statistics for ethnic groups targeted for screening by ACOG and ACMG guidelines. Statistics for all 23,453 individuals. 'Conditions included in ACOG 
guidelines. Conditions included in ACMG guidelines. e Cystic fibrosis analysis here accounts for only 23 mutations recommended by ACMG. Our total 
panel includes 100 mutations and these data are reported in Supplementary Table S1 online. 



contrast to the published 1 in 118 rate. 21 This corroborates 
reports that cystic fibrosis is under-reported in the South Asian 
population. 22 Another notable outlier in both Ashkenazi Jews 
and East Asians is carnitine palmitoyltransferase II deficiency, 
which appears as a notably frequent disease ( 1 in 43 in Ashkenazi 
Jews and 1 in 378 in East Asians), although the literature marks 
it as very rare. 23 Finally, we detect Smith-Lemli-Opitz syn- 
drome (SLOS) at a significantly higher than expected rate in 
all European populations, in Hispanics, and Middle Easterners 
(1 in 123 from literature vs. 1 in 68 here; 95% confidence inter- 
val 1 in 60-78). Prior work suggests that a higher carrier fre- 
quency for SLOS than that computed from its birth prevalence 
is reasonable, because mutations in DHCR7, the causative gene 
for SLOS, may cause significant fetal mortality. 24 

In the African- American population, we observe significantly 
reduced carrier frequencies relative to literature statistics 
for 7 of the top 10 diseases, other than sickle cell disease, 



a- 1 -antitrypsin deficiency, and cystic fibrosis. Comparison 
to prior data in this population is confounded by significantly 
varying levels of ethnic admixture in the African-American 
population; it is possible that our study population is genetically 
distinct at these loci relative to prior studies. 16 Although we have 
not attempted to estimate admixture proportions in our study 
patients, it may be an interesting avenue for future research. 

Similarly, in the East Asian population, we find much lower 
carrier frequencies for Pendred syndrome (1 in 252 vs. 1 in 
51) and Pompe disease (1 in 366.5 vs. 1 in 112). We suspect 
two independent causes for the discrepancy. First, general- 
population screening data are not available for these diseases 
in this population; consequently, statistics are often derived 
from targeted study populations enriched in the mutations in 
question. 25 Second, the East Asian population displays suffi- 
cient genetic diversity (e.g., population structure among Han 
Chinese, Koreans, and Japanese) to have confounded results 
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Table 5 Top 10 most common carrier frequencies by population 



Counsyl 



Counsyl 95% 



Literature 



Disease 


frequency (1 in) 


CI (1 in) 


frequency (1 in) 


# Tested 


Screening? 


All populations (W = 23,453) 












a-1 -Antitrypsin deficiency 


13.1 


12-14 


11.5 


1 5,484 


X 


Cystic fibrosis 


27.8 


26-30 


31.7 


23,369 


Y 


DFNB1 


42.6 


39-47 


42.8 


15,799 


X 


Spinal muscular atrophy 


57.1 


52-63 


54 


23,127 


1 


Familial Mediterranean fever 


64.2 


57-73 


Unknown 


15,854 


X 


Smith-Lemli-Opitz syndrome 


68.2 


60-78 


123 


15,825 


X 


Sickle cell disease/|3-thalassemia 


69.6 


63-78 


158 


21,360 


X 


Gaucher disease 


76.7 


69-87 


200 


21,473 


X 


Factor XI deficiency 


92.0 


80-108 


Rare 


15,724 


X 


Achromatopsia 


97.5 


85-1 1 5 


123 


15,798 


X 


Northwestern Europe (N = 12,915) 












a-1 -Antitrypsin deficiency 


10.2 


10-11 


11.4 


8,570 


X 


Cystic fibrosis 


22.9 


21-25 


28 


12,870 


Y 


DFNB1 


46.0 


40-54 


33.2 


8,735 


x 


Spinal muscular atrophy 


49.9 


45-57 


47 


12,730 


1 


Smith-Lemli-Opitz syndrome 


50.3 


44-59 


123.9 


8,750 


x 


MCAD deficiency 


73.5 


62-90 


61.7 


8,749 


X 


Hereditary fructose intolerance 


90.1 


75-113 


81.1 


8,744 


X 


Achromatopsia 


91.0 


76-114 


123 


8,734 


X 


Hereditary thymine-uraciluria 


96.4 


80-122 


33 


8,680 


x 


Familial Mediterranean fever 


99.5 


82-126 


Unknown 


8,756 


X 


Ashkenazi Jewish (N - 2,410) 












Factor XI deficiency 


12.6 


11-15 


8 


1,501 


X 


Familial Mediterranean fever 


13.2 


11-16 


10.5 


1,513 


X 


Gaucher disease 


16.8 


14-20 


17 


2,384 


1 


DFNB1 


21.3 


17-28 


21 


1,509 


X 


Cystic fibrosis 


21.6 


18-27 


29 


2,402 


Y 


a-1 -Antitrypsin deficiency 


24.2 


19-32 


16 


1,452 


X 


Hexosaminidase A deficiency 


26.9 


22-34 


27.4 


2,366 


Y 


SCAD deficiency 


29.1 


23-40 


Unknown 


1,511 


X 


Familial dysautonomia 


41.8 


33-57 


31 


2,385 


Y 


CPT II deficiency 


43.2 


33-65 


Rare/too few 
patients worldwide 


1,512 


X 


Hispanic (N =2,302) 












a-1 -Antitrypsin deficiency 


9.1 


8-11 


9.2 


1,608 


X 


Cystic fibrosis 


52.1 


40-74 


59 


2,294 


Y 


DFNB1 


67.6 


48-114 


100 


1,623 


X 


Spinal muscular atrophy 


81.2 


59-130 


68 


2,274 


1 


Sickle cell disease/f3-thalassemia 


83.1 


60-138 


128 


2,077 


X 


Pompe disease 


134.7 


85-326 


100 


1,616 


X 


CDG-la 


135.4 


86-328 


Unknown 


1,625 


X 


Smith-Lemli-Opitz syndrome 


135.5 


86-328 


No reliable data 


1,626 


X 


Familial Mediterranean fever 


136.6 


86-331 


Unknown 


1,639 


x 


Phenylalanine hydroxylase deficiency 


162.8 


99^162 


Unknown 


1,628 


x 



Confidence intervals computed as Wilson 95% score interval. Some listed diseases overlap in definition in terms of genetic variants. Specific variants tested 
for each disease listed here are given in Supplementary Table B. Please refer to Supplementary Table C for citations to previous literature. CDG-la, congenital 
disorder of glycosylation type la; CI, confidence intervals; CPT II deficiency, carnitine palmitoyltransferase II deficiency; DFNB1 , G./B2-related DFNB 1 nonsyn- 
dromic hearing loss and deafness; MCAD deficiency, medium chain acyl-CoA dehydrogenase deficiency; MLC, megalencephalic leukoencephalopathy with 
subcortical cysts; SCAD deficiency, short chain acyl-CoA dehydrogenase deficiency; Y, both ACMG and ACOG recommend offering screening to this popu- 
lation; 1, one of ACMCVACOG recommend offering screening to this population; x, neither ACMG/ACOG recommend offering screening to this population. 

Table 5 Continued on next page 
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Table 5 Continued 



Counsyl 



Counsyl 95% 



Literature 



Disease 


frequency (1 in) 


CI (1 in) 


frequency (1 in) 


# Tested 


Screening? 


African American (W = 1 , 1 93) 












Sickle cell disease/p-thalassemia 


9.7 


8-12 


10 


1,121 


1 


a-1 -Antitrypsin deficiency 


29.2 


21-50 


37.3 


672 


X 


Cystic fibrosis 


74.2 


50-149 


84 


1,188 


Y 


Pompe disease 


112.5 


61-800 


59.7 


675 


X 


Spinal muscular atrophy 


117.8 


72-334 


72 


1,178 


1 


Galactosemia 


169.5 


82-°° 


94 


678 


X 


Gaucher disease 


213.2 


109-4,770 


34.6 


1,066 


X 


DFNB1 


226.0 


99-°° 


25.3 


678 


X 


Hereditary fructose intolerance 


226.0 


99-°° 


Unknown 


678 


X 


Smith-Lemli-Opitz syndrome 


339.0 


127-°° 


137.8 


678 


X 


South Asia (N= 1,123) 












Achromatopsia 


23.5 


18-35 


123 


798 


x 


Sickle cell disease/p-thalassemia 


30.1 


23-45 


25 


1,055 


1 


Cystic fibrosis 


40.1 


29-64 


118 


1,122 


Y 


Spinal muscular atrophy 


73.3 


48-1 53 


52 


1,099 


1 


Hereditary thymine-uraciluria 


99.8 


58-371 


33 


798 


X 


DFNB1 


99.9 


58-372 


148 


799 


X 


Citrullinemia type 1 


212.0 


93-°° 


Unknown 


636 


X 


MLC 


212.0 


93-°° 


Unknown 


636 


X 


Biotinidase deficiency 


266.3 


116-°° 


123 


799 


X 


Tyrosinemia type I 


266.3 


116-°° 


173 


799 


X 


Eastern Asia (N = 1,121) 












DFNB1 


22.2 


17-34 


33-200 


756 


X 


Sickle cell disease/p-thalassemia 


78.0 


50-178 


50 


1,014 


1 


Spinal muscular atrophy 


85.5 


55-195 


59 


1,111 


1 


Gaucher disease 


96.9 


59-275 


Unknown 


969 


X 


Achromatopsia 


189.0 


91-°° 


123 


756 


X 


Cystic fibrosis 


223.6 


115-5011 


242 


1118 


Y 


a-1 -Antitrypsin deficiency 


249.0 


109-°° 


450 


747 


X 


Pendred syndrome 


252.0 


110-°° 


51 


756 


X 


Pompe disease 


366.5 


137-°° 


112 


733 


X 


CPT II deficiency 


378.0 


142-°° 


Rare/too few 
patients worldwide 


756 


X 


Southern Europe (W = 1,063) 












a-1 -Antitrypsin deficiency 


17.0 


13-24 


13.1 


715 


X 


Cystic fibrosis 


35.3 


26-55 


28 


1,059 


Y 


DFNB1 


37.2 


26-67 


33.2 


745 


X 


Spinal muscular atrophy 


40.3 


29-66 


47 


1,049 


1 


Sickle cell disease/p-thalassemia 


52.4 


36-97 


50 


996 


1 


Familial Mediterranean fever 


74.5 


46-211 


Unknown 


745 


X 


Phenylalanine hydroxylase deficiency 


74.6 


46-211 


50.5 


746 


X 


Pompe disease 


81.9 


49-260 


100 


737 


X 


Smith-Lemli-Opitz syndrome 


82.9 


49-263 


Unknown 


746 


x 


Hereditary thymine-uraciluria 


105.1 


59-495 


33 


736 


X 



Confidence intervals computed as Wilson 95% score interval. Some listed diseases overlap in definition in terms of genetic variants. Specific variants tested 
for each disease listed here are given in Supplementary Table B. Please refer to Supplementary Table C for citations to previous literature. CDG-la, congenital 
disorder of glycosylation type la; CI, confidence intervals; CPT II deficiency, carnitine palmitoyltransferase II deficiency; DFNB1 , G_/B2-related DFNB 1 nonsyn- 
dromic hearing loss and deafness; MCAD deficiency, medium chain acyl-CoA dehydrogenase deficiency; MLC, megalencephalic leukoencephalopathy with 
subcortical cysts; SCAD deficiency, short chain acyl-CoA dehydrogenase deficiency; Y, both ACMG and ACOG recommend offering screening to this popula- 
tion; 1,one of ACMG/ACOG recommend offering screening to this population; x, neither ACMG/ACOG recommend offering screening to this population. 
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Table 5 Continued 

Counsyl Counsyl 95% Literature 

Disease frequency (1 in) CI (1 in) frequency (1 in) # Tested Screening? 

Middle East (N= 51 2) 



Familial Mediterranean fever 


24.5 


1 7-49 


Vsrisblt? 
(1/10-1/20) 


392 


x 


Sickle cell disease/p-thalassemia 


5.1 


31-165 


30 (but variable) 


469 


X 


Hereditary thymine-uraciluria 


54.9 


31-256 


33 


384 


X 


Achromatopsia 


55.1 


31-257 


123 


386 


X 


Cystic fibrosis 


63.1 


37-234 


91 


505 


Y 


Spinal muscular atrophy 


72.4 


41-340 


25 


507 


1 


DFNB1 


77.6 


40-1 627 


83 


388 


X 


Inclusion body myopathy 2 


96.5 


47-oo 


1 5 (Iranian Jews), 
unknown in others 


386 


X 


Hereditary fructose intolerance 


97.0 


47-co 


Unknown 


388 


X 


Smith-Lemli-Opitz syndrome 


129.3 


57-co 


Rare 


388 


X 



Confidence intervals computed as Wilson 95% score interval. Some listed diseases overlap in definition in terms of genetic variants. Specific variants tested 
for each disease listed here are given in Supplementary Table B. Please refer to Supplementary Table C for citations to previous literature. CDG-la, congenital 
disorder of glycosylation type la; CI, confidence intervals; CPT II deficiency, carnitine palmitoyltransferase II deficiency; DFNB1 , G_/B2-related DFNB 1 nonsyn- 
dromic hearing loss and deafness; MCAD deficiency, medium chain acyl-CoA dehydrogenase deficiency; MLC, megalencephalic leukoencephalopathy with 
subcortical cysts; SCAD deficiency, short chain acyl-CoA dehydrogenase deficiency; Y, both ACMG and ACOG recommend offering screening to this popula- 
tion; 1, one of ACMG/ACOG recommend offering screening to this population; x, neither ACMG/ACOG recommend offering screening to this population. 



in earlier genetic studies. 26 Our self-reported ethnicity data do 
not have sufficient detail to identify this structure. 

We note that direct comparison between our results and 
the published literature can be difficult due to previously 
mentioned statistical considerations and differing population 
substructures. As an example, carrier frequencies in Middle 
Eastern subpopulations reported in previous literature can 
vary widely and could also be substantially different from fre- 
quencies in Americans of Middle Eastern origin. Therefore, 
the references in Table 5 are not meant to present an exhaus- 
tive review but to provide context for comparison. The specific 
ethnic distributions merit further examination because robust 
data are unavailable for many diseases. 

Comparison to ACOG/ACMG guidelines 

ACOG and ACMG guidelines aim to identify couples with a 
minimum a priori carrier risk, based on ethnic background or 
general-population disease prevalence. The ACOG recommends 
at least a subset of patients be offered carrier screening for CF, 
hemoglobinopathies, Tay-Sachs disease, familial dysautonomia, 
and Canavan disease. 1,2,6-8 ACMG recommendations add SMA, 
Fanconi anemia Group C, Niemann-Pick disease type A, Bloom 
syndrome, mucolipidosis IV, and Gaucher disease type 1 . 

For their intended purposes, the ACOG and ACMG guide- 
lines serve well. However, our results demonstrate that most of 
the heterozygous states we identified fall outside these guide- 
lines (Tables 4 and 5). Because the current thought is that the 
average individual is heterozygous for at least five recessive dis- 
eases, this is not surprising. 15 By comparison, screening most 
Caucasians only for CF, as is recommended by the ACOG, 
yields a 4% positive rate. Focusing on a smaller number of con- 
ditions and on targeted ethnic groups means that most indi- 
viduals at elevated risk are not identified. 



In particular, the data show that a number of severe 
Mendelian disorders are more prevalent than commonly 
understood (Table 5), are often present outside their charac- 
teristic ethnic groups (Table 4), and are not covered by current 
screening guidelines (Table 5). For example, we find a SLOS 
carrier frequency of 1 in 68.2 in our overall population, almost 
twice the previously reported 1 in 123 carrier frequency. 27 
Neither the ACOG nor the ACMG currently recommend 
offering screening for this severe condition. In contrast, the 
ACMG does recommend screening for SMA, with a similar 
worldwide carrier frequency of 1 in 57 (our data) or 1 in 54 
(literature). 28 Indeed, in the Northwestern European popula- 
tion, we find indistinguishable carrier frequencies for SLOS 
and SMA: 1 in 50.3 and 1 in 49.9, respectively. Carnitine pal- 
mitoyltransferase II deficiency presents another illustration of 
a disease with a frequency worthy of consideration of wider 
screening. In our Ashkenazi Jewish population, the carrier rate 
was higher than several other conditions currently included in 
screening guidelines (Table 5). 

Ethnicity-based screening model vs. universal screening 

Historically, screening has focused on a limited disorder list pri- 
marily determined by self-reported ethnic group. Others 1 ' 2,6 " 8,17 
have noted that this model will soon be inadequate, given: (i) 
2010 US Census data that demonstrate sharp increases in indi- 
viduals reporting mixed racial ancestry, particularly among the 
younger population approaching reproductive age; 18 (ii) limita- 
tions of, and patient preferences against, use of racial and ethnic 
categorization in medicine; 19,20 (iii) the possibility of unknown 
or unreported ancestry, due to limited family history knowledge, 
adoption, or other factors; and (iv) the decreasing cost of pan- 
ethnic screening due to advances in genomics. 11 Indeed, most 
participating clinics applied the same testing panel regardless of 
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ethnicity, phasing out the ethnicity-based screening paradigm 
in favor of a universal screening model. 

We have presented perhaps the most accurate measure- 
ments to date of carrier frequencies for hundreds of reces- 
sive Mendelian alleles using a large, ethnically diverse clinical 
sample. In contrast, prior data suffer from the statistical irregu- 
larities inherent in aggregating the results of multiple studies 
conducted under different protocols. Those irregularities result 
not from mistakes by the authors. Instead, the relative order- 
ing of disease frequencies can only be determined with confi- 
dence when they are all assessed on the same dataset and under 
the same conditions. This, to our knowledge, has never been 
accomplished on the scale of this study, where we have applied 
multiplex carrier testing to sample a large population at a large 
number of genetic loci simultaneously. 

Implications of widespread expanded carrier screening merit 
further study. These may include increased partner testing via 
screening or sequencing methods, higher demand for genetic 
counseling services and genetics education for nongenetics 
providers, cost-effectiveness, and patient interest and anxiety. 

Our data demonstrate that current ethnicity-based approaches 
of genetic screening are not optimally aligned with the real distri- 
bution of carrier frequencies for severe genetic disease. In partic- 
ular, we find that numerous conditions, not currently suggested 
for screening by the ACOG and the ACMG, are prevalent; and 
that other diseases currently recommended for screening only 
in certain populations, such as sickle cell disease and Canavan 
disease, are in fact widely distributed outside their "home" pop- 
ulations. We believe that these facts should be considered along- 
side the rapidly decreasing cost of multilocus genetic testing in 
the design of future carrier screening guidelines. 

SUPPLEMENTARY MATERIAL 

Supplementary material is linked to the online version of the paper 
at http://www.nature.com/gim 
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